NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation

Zhang, Tianyuan; Yu, Hong-Xing; Wu, Rundi; Feng, Brandon Y; Zheng, Changxi; Snavely, Noah; Wu, Jiajun; Freeman, William T (October 2024, Springer Nature Link)

Full Text Available
PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation

Zhang, Tianyuan; Yu, Hong-Xing; Wu, Rundi; Feng, Brandon Y; Zheng, Changxi; Snavely, Noah; Wu, Jiajun; Freeman, William T (October 2024, European Conference on Computer Vision (ECCV))

Full Text Available
Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis

Van_Hoorick, Basile; Wu, Rundi; Ozguroglu, Ege; Sargent, Kyle; Liu, Ruoshi; Tokmakov, Pavel; Dave, Achal; Zheng, Changxi; Vondrick, Carl (September 2024, European Conference on Computer Vision)

Full Text Available
Textureless Deformable Object Tracking with Invisible Markers

https://doi.org/10.1109/TPAMI.2024.3463422

Li, Xinyuan; Guo, Yu; Tu, Yubei; Ji, Yu; Liu, Yanchen; Ye, Jinwei; Zheng, Changxi (January 2024, IEEE Transactions on Pattern Analysis and Machine Intelligence)

Full Text Available
Learning to Generate 3D Shapes from a Single Example

https://doi.org/10.1145/3550454.3555480

Wu, Rundi; Zheng, Changxi (December 2022, ACM Transactions on Graphics)

Existing generative models for 3D shapes are typically trained on a large 3D dataset, often of a specific object category. In this paper, we investigate the deep generative model that learns from only a single reference 3D shape. Specifically, we present a multi-scale GAN-based model designed to capture the input shape's geometric features across a range of spatial scales. To avoid large memory and computational cost induced by operating on the 3D volume, we build our generator atop the tri-plane hybrid representation, which requires only 2D convolutions. We train our generative model on a voxel pyramid of the reference shape, without the need of any external supervision or manual annotation. Once trained, our model can generate diverse and high-quality 3D shapes possibly of different sizes and aspect ratios. The resulting shapes present variations across different scales, and at the same time retain the global structure of the reference shape. Through extensive evaluation, both qualitative and quantitative, we demonstrate that our model can generate 3D shapes of various types. 1
more » « less
Full Text Available
Can one hear the shape of a neural network?: Snooping the GPU via Magnetic Side Channel

Maia, Henrique Teles; Xiao, Chang; Li, Dingzeyu; Grinspun, Eitan; Zheng, Changxi (August 2022, 31st USENIX Security Symposium (USENIX Security 22))

Neural network applications have become popular in both enterprise and personal settings. Network solutions are tuned meticulously for each task, and designs that can robustly resolve queries end up in high demand. As the commercial value of accurate and performant machine learning models increases, so too does the demand to protect neural architectures as confidential investments. We explore the vulnerability of neural networks deployed as black boxes across accelerated hardware through electromagnetic side channels. We examine the magnetic flux emanating from a graphics processing unit’s power cable, as acquired by a cheap $3 induction sensor, and find that this signal betrays the detailed topology and hyperparameters of a black-box neural network model. The attack acquires the magnetic signal for one query with unknown input values, but known input dimensions. The network reconstruction is possible due to the modular layer sequence in which deep neural networks are evaluated. We find that each layer component’s evaluation produces an identifiable magnetic signal signature, from which layer topology, width, function type, and sequence order can be inferred using a suitably trained classifier and a joint consistency optimization based on integer programming. We study the extent to which network specifications can be recovered, and consider metrics for comparing network similarity. We demonstrate the potential accuracy of this side channel attack in recovering the details for a broad range of network architectures, including random designs. We consider applications that may exploit this novel side channel exposure, such as adversarial transfer attacks. In response, we discuss countermeasures to protect against our method and other similar snooping techniques.
more » « less
Full Text Available
Penetration-free projective dynamics on the GPU

https://doi.org/10.1145/3528223.3530069

Lan, Lei; Ma, Guanqun; Yang, Yin; Zheng, Changxi; Li, Minchen; Jiang, Chenfanfu (July 2022, ACM Transactions on Graphics)

We present a GPU algorithm for deformable simulation. Our method offers good computational efficiency and penetration-free guarantee at the same time, which are not common with existing techniques. The main idea is an algorithmic integration of projective dynamics (PD) and incremental potential contact (IPC). PD is a position-based simulation framework, favored for its robust convergence and convenient implementation. We show that PD can be employed to handle the variational optimization with the interior point method e.g., IPC. While conceptually straightforward, this requires a dedicated rework over the collision resolution and the iteration modality to avoid incorrect collision projection with improved numerical convergence. IPC exploits a barrier-based formulation, which yields an infinitely large penalty when the constraint is on the verge of being violated. This mechanism guarantees intersection-free trajectories of deformable bodies during the simulation, as long as they are apart at the rest configuration. On the downside, IPC brings a large amount of nonlinearity to the system, making PD slower to converge. To mitigate this issue, we propose a novel GPU algorithm named A-Jacobi for faster linear solve at the global step of PD. A-Jacobi is based on Jacobi iteration, but it better harvests the computation capacity on modern GPUs by lumping several Jacobi steps into a single iteration. In addition, we also re-design the CCD root finding procedure by using a new minimum-gradient Newton algorithm. Those saved time budgets allow more iterations to accommodate stiff IPC barriers so that the result is both realistic and collision-free. Putting together, our algorithm simulates complicated models of both solids and shells on the GPU at an interactive rate or even in real time.
more » « less
Full Text Available
MoiréBoard: A Stable, Accurate and Low-cost Camera Tracking Method

https://doi.org/10.1145/3472749.3474793

Xiao, Chang; Zheng, Changxi (October 2021, The 34th Annual ACM Symposium on User Interface Software and Technology)

Camera tracking is an essential building block in a myriad of HCI applications. For example, commercial VR devices are equipped with dedicated hardware, such as laser-emitting beacon stations, to enable accurate tracking of VR headsets. However, this hardware remains costly. On the other hand, low-cost solutions such as IMU sensors and visual markers exist, but they suffer from large tracking errors. In this work, we bring high accuracy and low cost together to present MoiréBoard, a new 3-DOF camera position tracking method that leverages a seemingly irrelevant visual phenomenon, the moiré effect. Based on a systematic analysis of the moiré effect under camera projection, MoiréBoard requires no power nor camera calibration. It can be easily made at a low cost (e.g., through 3D printing), ready to use with any stock mobile devices with a camera. Its tracking algorithm is computationally efficient, able to run at a high frame rate. Although it is simple to implement, it tracks devices at high accuracy, comparable to the state-of-the-art commercial VR tracking systems.
more » « less
Full Text Available
DeepCAD: A Deep Generative Network for Computer-Aided Design Models

https://doi.org/10.1109/ICCV48922.2021.00670

Wu, Rundi; Xiao, Chang; Zheng, Changxi (October 2021, International Conference on Computer Vision)

Deep generative models of 3D shapes have received a great deal of research interest. Yet, almost all of them generate discrete shape representations, such as voxels, point clouds, and polygon meshes. We present the first 3D generative model for a drastically different shape representation—describing a shape as a sequence of computer-aided design (CAD) operations. Unlike meshes and point clouds, CAD models encode the user creation process of 3D shapes, widely used in numerous industrial and engineering design tasks. However, the sequential and irregular structure of CAD operations poses significant challenges for existing 3D generative models. Drawing an analogy between CAD operations and natural language, we propose a CAD generative network based on the Transformer. We demonstrate the performance of our model for both shape autoencoding and random shape generation. To train our network, we create a new CAD dataset consisting of 178,238 models and their CAD construction sequences. We have made this dataset publicly available to promote future research on this topic.
more » « less
Full Text Available
VarRCWA: An Adaptive High-Order Rigorous Coupled Wave Analysis Method

https://doi.org/10.1021/acsphotonics.2c00662

Zhu, Ziwei; Zheng, Changxi (September 2022, ACS Photonics)

« Prev Next »

Search for: All records